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An intracellular defective-interfering (DI) RNA, DissE, of mouse hepatitis virus (MHV) obtained after serial high multi- 
plicity passage of the virus was cloned and sequenced. DissE RNA is composed of three noncontiguous genomic 
regions, representing the first 864 nucleotides of the 5’ end, an internal 748 nucleotides of the polymerase gene, and 
601 nucleotides from the 3’ end of the parental MHV genome. The DIssE sequence contains one large continuous open 
reading frame. Two protein products from this open reading frame were identified both by in vitro translation and in DI- 
infected cells. Sequence comparison of DIssE and the corresponding parts of the parental virus genome revealed that 
DissE had three base substitutions within the leader sequence and also a deletion of nine nucleotides located at the 
junction of the leader and the remaining genomic sequence. The 5’ end of DissE RNA was heterogeneous with respect 
to the number of UCUAA repeats within the leader sequence. The parental MHV genomic RNA appears to have exten- 
sive and stable secondary structures at the regions where DI RNA rearrangements occurred. These data suggest that 
MHV DI RNA may have been generated as a result of the discontinuous and nonprocessive manner of MHV RNA 


synthesis. © 1988 Academic Press, Inc. 


INTRODUCTION 


Mouse hepatitis virus (MHV), a member of the Coro- 
naviridae, contains a single-stranded, positive-sense 
RNA of approximately 6 X 10® Da (Lai and Stohlman, 
1978; Wege eta/., 1978). In infected cells, the genomic 
RNA of MHV is first translated into an RNA-dependent 
RNA polymerase (Brayton et a/., 1982, 1984; Mahy et 
al., 1983) which is responsible for the synthesis of a 
genomic-sized negative-stranded RNA (Lai et ai, 
1982b). The negative-stranded RNA then serves as the 
template for the synthesis of six subgenomic and a 
genomic-sized mRNA (Brayton et a/., 1984; Lai et a/., 
1982b). These mRNAs are arranged in the form of a 
3’ coterminal ‘‘nested”’ set, i.e., the sequence of each 
mRNA ‘ts contained entirely within the next larger 
MRNA (Lai et a/., 1981; Leibowitz er a/., 1981). In addi- 
tion, each mRNA has a common leader sequence, 
which is derived from the 5’ end of the genome (Lai et 
a/., 1982a, 1983, 1984; Spaan et a/., 1983). Several 
pieces of evidence demonstrated that MHV utilizes a 
novel mechanism of leader RNA-primed transcription, 
in which a free leader RNA species derived from the 
5’ end of genomic RNA is utilized as a primer for the 
transcription of subgenomic mRNAs (Baric et a/., 1983, 
1985; Makino et a/., 1986b). 

Another unusual feature of coronavirus RNA synthe- 
sis is that the virus undergoes RNA-RNA recombina- 
tion at a very high frequency (Makino et a/., 1986a). The 
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unusually high frequency, approaching 10% under 
some circumstances (Makino et a/., 1986a), of coro- 
navirus RNA recombination suggests that discontinu- 
ous RNA transcripts might be generated during coro- 
navirus RNA synthesis. These incomplete RNA inter- 
mediates may rejoin the original or different RNA 
template to continue RNA synthesis, resulting in RNA 
recombination in the latter case. The detection of such 
RNA intermediates in MHV-infected cells (Baric et a/., 
1985, 1987) suggests that coronavirus genomic RNA 
synthesis involves a discontinuous and nonprocessive 
mechanism, which may account for the high frequency 
of recombination via a copy choice mechanism. 

Defective-interfering (DI) particles are naturally oc- 
curing deletion mutants that have been described for 
many virus groups. Characteristically, DI particles (a) 
lack part of the viral genome, (b) contain normal viral 
structural proteins, (c) replicate only with the aid of a 
helper standard virus, and (d) interfere with replication 
of homologous standard virus. Deletion of genomic se- 
quence can occur in various regions of the genome; 
however, all of the DI RNAs apparently retain signals 
for RNA replication since they can be replicated in the 
presence of helper virus. The generation of DI RNA can 
be viewed as the result of abnormal RNA replication or 
illigitimate RNA recombination. Therefore, the struc- 
ture of DI RNA is of particular interest in elucidation of 
the mechanism of viral RNA replication and recombina- 
tion. 

We have previously reported the generation of DI 
particles during high multiplicity passages of the JHM 
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Fic. 1. Intracellular RNA species in Dl-infected cells. **P-Labeled 
RNA from MHV-JHM-infected cells (a) and DI particles-infected cells 
(b) were electrophoresed in a 1% agarose gel without denaturation. 
Numbers 1, 2, 3, 6, and 7 represent the major MHV-JHM-specific 
mRNA species. 


strain of MHV (MHV-JHM) (Makino et a/., 1984a). In DI- 
infected cells, the synthesis of most of the standard 
viral MRNAs is inhibited. Instead, three distinct virus- 
specific RNA species could be detected (Makino et a/., 
1985) (Fig. 1). The first species, DIssA, is equivalent to 
DI virion RNA in length and is eventually incorporated 
into virus particles. This RNA differs from the standard 
virus genome in that it contains multiple deletions dis- 
tributed throughout the genome, except for the 5’ and 
3’ ends of the genomic RNA (Makino et a/., 1985), 
which encode RNA polymerase (gene A) and nucleo- 
capsid (N) protein, respectively. Surprisingly, DissA 
RNA can replicate by itself in the absence of helper vi- 
rus infection, suggesting that DIssA codes for func- 
tional RNA polymerases (Makino et a/., 1988). Thus, 
DissA is not a defective RNA in a strict sense. The sec- 
ond major RNA species found in Dl-infected cells is in- 
distinguishable from the mRNA 7 made by the standard 
virus. The synthesis of this mRNA and its product N 
protein is not inhibited in Dl-infected cells. The third 
RNA species is a novel single-stranded polyadenylated 
DI RNA species of varying size. Oligonucleotide fin- 
gerprinting studies suggest that it represents se- 


quences derived from various noncontiguous parts of 
the genome. The size of this RNA varies with the DI 
passage level (Makino et a/., 1985). One of these 
RNAs, DissE, which is the smallest DI RNA detected, 
has been analyzed in greater detail (Makino et a/., 
1988). In contrast to DilssA, DIssE RNA synthesis re- 
quires helper virus coinfection (Makino et a/., 1988). 
Only a trace amount of it is incorporated into virus parti- 
cles to serve as a template for RNA replication (Makino 
et a/., 1988). Thus, it may lack packaging signals. On 
the other hand, since it is efficiently replicated in DI- 
infected cells, DIssE RNA must contain the sequences 
essential for viral RNA replication. 

In the present study, we analyzed the primary struc- 
ture of DissE RNA. The results revealed that DIssE con- 
sists of three noncontiguous regions of MHV-JHM ge- 
nomic RNA, including 5’ end leader RNA and the 3’ end 
of genomic RNA. One large open reading frame (ORF) 
was demonstrated and the product of this ORF was 
identified both in infected cells and by in vitro transla- 
tion. Possible mechanisms of DI RNA generation are 
discussed. 


MATERIALS AND METHODS 
Viruses and cell culture 


MHV-JHM was used as a nondefective standard vi- 
rus. Serially passaged MHV-JHM stock at passage 
level 17 was used as the source of DI particles (Makino 
et al., 1985). All viruses were propagated in DBT cells 
as described previously (Makino et a/., 1984a). 


Preparation of virus-specific intracellular RNA 


MHV-specific intracellular RNA was extracted by 
procedures described previously (Makino et a/., 
1984b). Poly(A)-containing RNA was obtained by oli- 
go(dT)-cellulose column chromatography (Makino et 
al., 1984b). 


Agarose gel electrophoresis 


32P_Labeled virus-specific RNA was analyzed by 
electrophoresis on 1% agarose gels without dena- 
turing as described previously (Makino et a/., 1988). 
Poly(A)-containing RNA was purified by preparative gel 
electrophoresis in 1% urea—agarose gels as previously 
described (Makino et a/., 1984a). The RNA was eluted 
from gel slices by the methods of Langridge et ai., 
(1980). 


cDNA cloning of DissE 


cDNA cloning followed the general method of Gubler 
and Hoffman (1983). Five hundred nanograms of oli- 
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go(dT),2-1g was mixed with 2 wg of gel-purified DIssE 
RNA in 10 ul of distilled water. The RNA and oligo(dT) 
mixture was heated at 70° for 3 min and chilled quickly. 
The RNA-DNA hybrid was then incubated in 50 ul of 
first-strand cDNA synthesis buffer containing 60 units 
of RNasin (Promega Biotec), 50 mM Tris—HCl (pH 8.3 
at 42°),100 mM KCl, 10 mM MgCl,, 10 mM DTT, 1.25 
mM each of dATP, dCTP, dGTP, and TTP, and 20 units 
of avian myeloblastosis virus reverse transcriptase (Life 
Science) at 42° for 1 hr. The cDNA synthesis was 
stopped by adding 4.4 ul of 250 mM EDTA. Nucleic 
acids were extracted with phenol-chloroform—isoamy| 
alcohol and precipitated with ethanol. 

Second-strand synthesis was carried out in a reac- 
tion volume of 100 ul containing 20 mM Tris-HCl (pH 
7.4), 5 mM MgCl., 100 mM KCI, 50 ug/ml of BSA, 10 
mM (NH,)2SO,, 0.15 mM B-NAD, 100 uM dNTPs, 25 
units of DNA polymerase |, 2 units of Escherichia coli 
DNA ligase, 0.8 units of RNase H, and the product from 
the first strand reaction. The mixture was incubated at 
12° for 1 hr, and then at 22° for 1 hr. The reaction was 
stopped by adding 8.7 ul of 250 mM EDTA, and prod- 
ucts were extracted with phenol—chloroform—isoamyl 
alcohol, and precipitated with ethanol. Double- 
stranded DNA was dC-tailed in a 12-ul reaction mixture 
containing 10 units of terminal transferase, 200 mM 
potassium cacodylate, 0.56 mM CoCl,, 25 mM Tris— 
HCI (pH 6.9), 2 mM DTT, 250 ug/ml BSA, and 50 pM 
dCTP at 37° for 4 min. The dC-tailed double-stranded 
DNA was annealed to 200 ng of dG-tailed Pstl-cut 
pBR322 plasmid in 20 ul of a buffer containing 10 mM 
Tris-HCl (pH 7.4), 100 mM NaCl, and 0.25 mM EDTA. 
The DNA mixture was heated at 68° for 5 min and then 
cooled slowly overnight for annealing. The annealed 
molecules were used to transform E. coli MC1061 as 
described (Dagert and Ehrlich, 1979). 


Identification of large cDNA clones containing DIssE 
sequence 


32P_Labeled MHV-JHM gene A cDNA clones C96 
and F82 (Shieh et a/., 1987) and 5’ end °*P-labeled lead- 
er-specific 72-mer derived from leader sequence of 
MHV (Lai et a/, 1984) were used for colony hybridiza- 
tion (Shieh et a/., 1987) to isolate DlissE-specific cDNA 
clones. Colonies yielding a strong signal were further 
analyzed by Southern hybridization (Maniatis et a/., 
1982). 


Primer extension 


The gel-purified RNAs were incubated in 8 ul of dis- 
tilled water containing 10 mM methyl mercury. After 10 
min incubation at room temperature, RNA was incu- 


bated in 50 ul of first-strand cDNA synthesis buffer with 
28 mM £B-mercaptoethanol and 5’ end-labeled oligo- 
deoxyribonucleotides at 42° for 1 hr. Reaction prod- 
ucts were extracted with phenol-chloroform—isoamy| 
alcohol, precipitated with ethanol, and analyzed by 
electrophoresis on 6% polyacrylamide gels containing 
8.3 M urea and were eluted from the gels according to 
the published procedures (Maxam and Gilbert, 1980). 


DNA sequencing 


Sequencing was carried out by Sanger’s dideoxyri- 
bonucleotide chain termination method (Sanger et a/., 
1977) and Maxam-—Gilbert chemical modification pro- 
cedure (Maxam and Gilbert, 1980), as described pre- 
viously (Soe et a/., 1987). Sequence analysis and pre- 
dicted RNA secondary structures were obtained with 
the Intelligenetics sequencing program. 


/n vitro translation 


An mRNA-dependent rabbit reticulocyte lysate (New 
England Nuclear) was used as previously described 
(Soe et a/., 1987). 


Antisera 


A monoclonal antibody, J.3.3, directed against the 
MHV-JHM N protein has been described (Fleming et 
a/., 1983). The anti-p28 antibody was generated in rab- 
bits against a synthetic peptide representing a portion 
of the MHV-JHM p28 protein (Soe et a/., 1987) and will 
be described in detail elsewhere (S. C. Baker ef a/., 
manuscript in preparation). 


Labeling of intracellular proteins, 
immunoprecipitation, and SDS—polyacrylamide 
gel electrophoresis 


DBT cells were infected with either wild type MHV- 
JHM or MHV-JHM containing DI particles at 2 PFU per 
cell. At 7.5 hr postinfection, cells were labeled in methi- 
onine-free medium containing 30 uCi of L-[3°S]methio- 
nine/mi (ICN translabel) for 30 min. Cell extracts were 
prepared by treatment with lysolecithin (L-a-lysophos- 
phatidylcholine, palmitoyl; Sigma) at 125 wg/mi for 1 
min at 4°. The treated cells were scraped in 300 ul HND 
buffer (0.1 M HEPES, pH 8.0, 0.2 M NH,CI, 0.005 M@ 
DTT), disrupted by pipetting with a Pastuer pipet, and 
then centrifuged at 800 g for 5 min to remove nuclei 
and cell debris. The resulting supernatant was used for 
immunoprecipitation. 

Immunoprecipitation was performed by the methods 
of Kessler (1981). The cell-free extracts were incubated 
with 3 yl of antisera for 4 hr at 4°. The antigen-antibody 
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complexes were collected by binding to Pansorbin 
(Calbiochem, La Jolla, CA) and washed three times 
with washing buffer (560 mM Tris-HCl, pH 7.4, 150 mM 
NaCl, 5 mM EDTA, and 0.5% NP-40) and eluted by boil- 
ing for 2 min in electrophoresis sample buffer (0.1 M p- 
mercaptoethanol, 1% SDS, 0.08 M Tris-HCl, pH 6.8, 
and 10% glycerol). The bacteria were removed by cen- 
trifugation and proteins were analyzed by electrophore- 
sis on 5 to 15% SDS-polyacrylamide gels (Laemmii, 
1970). 


RESULTS 
cDNA cloning and sequencing of DissE RNA 


To understand the primary structure of DIssE RNA, 
DIssE-specific cDNA clones were generated accord- 
ing to the general method of Gubler and Hoffman 
(1983), using oligo(dT) as a primer and gel-purified 
DIssE RNA. Since previous oligonucleotide fingerprint- 
ing analysis suggested that DIssE RNA contains the 
leader sequence and the 5’ end region of genomic se- 
quence (Makino et a/., 1985), cDNA clones were 
screened by colony hybridization using 5’ end-labeled, 
leader-specific 72-mer, and two cDNA clones F82 and 
C96, which correspond to the 5’ end of genomic RNA 
of MHV-JHM (Shieh et a/., 1987). Several large cDNA 
clones were isolated and their structure was further an- 
alyzed. A diagram representing the structure of the 
DIssE genome and that of MHV-JHM genomic RNA 
and the strategy used for sequencing the cDNA clones 
are shown in Fig. 2. The DIssE sequence obtained is 
shown in Fig. 3. 

Sequence analysis of DIssE cDNA clones revealed 
that DIssE RNA consists of three different regions of 
MHV-JHM genomic RNA. The first region represents 
864 nucleotides from the 5’ end of the genomic RNA. 
The second region, 748 nucleotides in length, is a re- 
gion within the polymerase gene that corresponds to 
the region at 3.3 to 4 kb from the 5’ end of genomic 
RNA (Shieh, unpublished observation), and the third re- 
gion contains a sequence of 601 nucleotides derived 
from the extreme 3’ end of the genomic RNA. The entire 
sequence of DIssE RNA is identical to that of the corre- 
sponding regions of MHV genomic RNA (Skinner and 
Siddell, 1983; Soe et a/., 1987; Shieh et a/., unpub- 
lished data), with some exceptions in the leader se- 
quence region (see below). 

The cDNA clones obtained does not appear to have 
a complete sequence at its extreme 5’ end. To under- 
stand the complete 5’ end sequence of DIssE, we per- 
formed primer-extension studies on DIssE RNA using 
a specific primer (5-AATGTCAGCACTATGACA-3') 
complementary to nucleotides 123-140 from the 5’ 


end of the genome of MHV-JHM (Shieh et a/., 1987). 
The 5’ end-labeled primer was hybridized to gel-purified 
DissE RNA and extended with reverse transcriptase. 
Primer extension products were then analyzed by elec- 
trophoresis on 6% polyacrylamide gels containing 8 M 
urea. As shown in Fig. 4A, two cDNA products of 136 
and 131 nucleotides were obtained, indicating hetero- 
geneity at the 5’ end sequence of DIssE. These primer- 
extended products were sequenced by the Maxam-— 
Gilbert method. The sequences of both cDNA prod- 
ucts were identical except that the faster migrating 
cDNA products contained three UCUAA repeats at the 
3’ end of the leader sequence, while the slower migrat- 
ing species contained four UCUAA repeats (Fig. 4B). In 
addition, the 5’end sequences of DIssE and MHV-JHM 
genomic RNA showed several differences. Within the 
leader sequence, 3 bases were substituted in DissE 
RNA (Fig. 4B, asterisks) and nine nucleotides (UUUAU- 
AAAC) were deleted in DIssE at the junction between 
the leader RNA and the remaining genomic se- 
quences. The significance of the heterogeneity in the 
number of UCUAA repeats and of the nine-nucieotide 
deletion will be discussed below. 


Translation of DIssE RNA in vitro and in vivo 


Another significant feature of DissE RNA is the pres- 
ence of a single large ORF (Fig. 3). This ORF is ex- 
pected to share amino acid sequence identity with 
three different regions of the standard MHV-JHM. The 
first 218 amino acids correspond to the N terminus of 
the MHV polymerase. This region represents the part 
of the N-terminus of the polymerase protein which is 
cleaved into a p28 protein (Denison and Perlman, 
1986; Soe et a/., 1987). The following 250 amino acids 
were derived from the region of the polymerase at 3.3 
to 4 kb from the 5’ end of the genome. The 3’ end region 
of the ORF of DIssE RNA is the same as the ORF uti- 
lized for the N protein (Skinner and Siddell, 1983). Thus, 
the predicted product of this ORF should contain the 
N-terminus of p28 and the C-terminus of the N protein. 
The predicted molecular weight mass of this ORF prod- 
uct is 62,538. 

To examine whether the ORF of DissE RNA is utilized 
for translation, we first performed in vitro translation in 
a rabbit reticulocyte lysate of DIssE RNA purified from 
the Dl-infected cells. Two proteins with an apparent 
molecular mass of approximately 88,000 (88K) and 
79,000 (79K) were detected (Fig. 5A). Both proteins 
were immunoprecipitated with anti-N protein mono- 
clonal antibody and anti-p28 antibody (Fig. 5A, lanes 2 
and 3). Therefore, these two proteins were likely the 
translation products of DIssE RNA. A minor band of ap- 
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Fic. 2. Diagram of the structure of DissE RNA and the strategy used for sequencing DissE cDNA clones. (a) A comparison between the 
sequence of DissE RNA and that of the standard MHV-JHM genomic RNA. A-G represent the seven genes of MHV (Lai et a/., 1981). (b) Structure 
of DissE-specific cDNA clones. (c) Strategy for sequencing of DissE. Arrows starting with solid circles indicate DNA sequenced by the Maxam— 
Gilbert chemical method with 3’ end-labeled DNA. Arrows starting with open circles indicate DNA sequenced by the dideoxy method. 


proximately 60 kDa had the same electrophoretic mo- 
bility as the N protein of MHV-JHM, and was precipi- 
tated with anti-N monoclonal antibody, but not with 
anti-p28 antibody (Fig 5A, lanes 2 and 3). Thus, this 
protein is most likely the N protein translated from the 
contaminated mRNA 7 in the DissE RNA preparation. 
The synthesis of DissE-specific protein in Dl-infected 
cells was then examined. DBT cells were mock-in- 
fected (Fig. 5B, lanes 1 and 4), infected with MHV-JHM 
(Fig. 5B, lanes 2 and 5), or infected with MHV-JHM con- 
taining DI particles (Fig. 5B, lanes 3 and 6). Both 88K 
and 79K proteins were specifically immunoprecipitated 
with anti-N monoclonal antibody and anti-p28 antibody 
from Dl-infected cells. The amount of these two pro- 
teins was low as compared to the N protein. Neverthe- 
less, they were reproducibly detected in Dl-infected 
cells. Thus, the DissE RNA is a functional MRNA. The 
relationship between the two protein species detected 
is not clear. The discrepancy between the predicted 
and observed molecular weights of the translation 


products of DIssE could be due to post-translational 
modification of the protein or aberrant migration of the 
protein. A small amount of p28 was immunoprecipi- 
tated with anti-p28 antibody in MHV-JHM-infected cells 
(Fig. 5B, lane 5). However, this protein was hardly de- 
tectable in Dl-infected cells (Fig. 5B, lane 6). The ab- 
sence of detectable amount of p28 in Dl-infected cells 
may be due to the inhibition of MHV-JHM genomic RNA 
synthesis by DI particles (Makino et a/., 1985). 


Possible secondary structure at the DI RNA 
rearrangment sites 


Sequence analysis revealed that DIssE RNA con- 
sisted of three noncontiguous regions of MHV-JHM ge- 
nomic RNA. We have previously proposed that coro- 
navirus RNA synthesis proceeds by a discontinuous, 
nonprocessive mechanism, being interrupted at sites 
with hairpin loops (Baric et a/., 1987). This transcrip- 
tional interruption could account for the generation of 
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5 '-TATAAGAGTGAATGGCGTCCGTACGTACCCAATCTACTCTAAAACTCTTGTAGTTTAAATCTAATCTAATCTAATCTAAACGGC 


MPVGLVULSs @ 
ACTTCCTGCGTGTCCATGCCCGTGGGCCTGGTCTTGTCATAGTGCTGACATTTGTGGTTCCTTGACTTTCTGTCTCTGCCAGTG 


MA KM GK Y GLGFKWQ€A 
ACGTGTCCATTCGGCGCCAGCAGCCCACCCATAGGTTGCATAATGGCAAAGATGGGCAAATACGGTCTCGGCTTCAAATGGGCC 


P EF P WM L PNA S EK LGN PE RS EE DG FC P § 
CCAGAATTTCCATGGATGCTTCCGAACGCATCGGAGAAGTTGGGTAACCCTGAGAGGTCAGAGGAGGATGGGTTTTGCCCCTCT 


A AQ EP K V K GK T LVN HV RVD€C S RLDPALEC 
GCTGCGCAAGAACCGAAAGTTAAAGGAAAAACTTTGGTTAATCACGTGAGGGTGGATTGTAGCCGGCTTCCAGCCTTGGAGTGC 


cv Q S$ ATIdI RODI F V DED PQK V EAS TMM AL Q 
TGTGTTCAGTCCGCCATAATCCGTGATATTTTTGTTGACGAGGATCCCCAGAAGGTGGAGGCCTCGACTATGATGGCATTGCAG 


F GS Av LvK PS KR LS&VQAWAK LGVLP K T P 
TTCGGTAGTGCTGTCTTGGTCAAGCCATCCAAGCGCTTGTCTGTTCAGGCATGGGCTAAGTTGGGTGTGCTGCCTAAAACTCCG 


AM GULF K R FC LCN T R ECvVC DA HVA FQUL F T 
GCCATGGGGTTGTTCAAGCGCTTCTGCCTGTGTAACACCAGGGAGTGCGTTTGTGACGCCCACGTGGCCTTTCAACTTTTTACG 


vQPODBGVcCLGNGRFIGWFEVPV TA IP E Y A XK 
GTCCAGCCCGATGGTGTATGCCTGGGTAACGGCCGTTTTATAGGCTGGTTCGTTCCAGTCACAGCCATACCGGAGTATGCGAAG 


QwteL.tegQewWs ILL RK GGNK GS VTS G HF RRA V 
CAGTGGTTGCAACCCTGGTCCATCCTTCTTCGTAAGGGTGGTAACAAAGGGTCTGTGACATCCGGCCATTCCCGCCGCGCTGTT 


T MPV ¥ DF NAT DV VY ADEN QODODDAD DPV V 
BOCA GC CEST TA ONE TT LAR TOCA CARNE SE TG LNT AT ECCT I DAREC ARES LGRTGRTGCTSRCSETCCTSTAGTY 


LV AODTQEEDGVAREQVODS ADS ETcCcVA RH T 
CTTGTCGCCGATACCCAAGAAGAGGACGGCGTTGCCAGGGAGCAGGTTGATTCGGCTGATTCGGAAATTTGTGTTGCGCACACT 


GGQEMtTEPODVVGS& QT PIAS AEET EV GEA 
GTTGGTCAAGAAATGACTGAGCCTGATGTCGTCGGATCTCAAACTCCCATCGCCTCTGCTGAGGAAACCGAAGTCGGTGAGGCA 


Cc DREGtHIAEVK ATV CAD ALODACPODQVEA F 
TGCGACAGGGAAGGGATTGCTGAGGTCAAGGCAACTGTGTGTGCTGATGCTTTAGATGCCTGCCCCGATCAAGTGGAGGCATTT 


DIEK V EoD Ss ILS ELB QT EL N AP AOD K TF Y ED V 
GATATTGAAAAGGTTGAAGACAGTATCTTAAGTGAGCTTCAAACCGAACTTAATGCGCCCGCGGACAAGACCTATGAGGATGTC 


L AF DA IY $§ ET LS AF Y AVP S DE T H F KV CG 
TTGGCATTCGATGCCATATACTCAGAGACGTTGTCTGCATTCTATGCTGTGCCGAGTGATGAGACGCACTTTAAAGTGTGTGGA 


F ¥ S$ P AIT ERTNCWLRS TLI VM QS L PLE F K 
TTCTATTCGCCAGCTATAGAGGCTACTAATTGTTGGCTGCGTTCTACTTTGATAGTAATGCAGAGITTACCTTTGGAATTTAAA 


DLGM@oQkK LWLsS ¥ K AG YY DQ C F V DK LV K SS A P 
GACTTGGGGATGCAAAAGCTCTGGTTGTCTTACAAGGCTGGCTATGATCAATGCTTTGTGGACAAACTAGTTAAGAGCGCGCCC 


K S$ I ILPOG GY VAD F AY F FL § Q € S F K V HA 
AAGTCTATTATTCTTCCACAAGGTGGCTATGTGGCAGATTTTGCCTATTTTTTCCTAAGCCAGTGTAGCTTCAAAGTTCATGCT 


NW RCL K R F DS T LP GF ET IM K VULN E NUN A 
DSR OH ne Sy haan TELS ESTATE TAC NRG LD Teen e ATC TRACT GAT en Rr en ennT Onn 


Y QNQOODGGADVV S&S PK P QR KRGTKQKAQ K OD 
TACCAGAATCAAGATGGTGGTGCAGATGTAGTGAGCCCTAAGCCTCAGAGAAAGAGAGGGACAAAGCAAAAGGCTCAGAAAGAT 


Ev DN VS V AK PK S SVQRNV S$ REL T PED R S 
GAAGTAGATAATGTAAGCGTTGCAAAGCCCAAAAGCTCTGTGCAGCGAAATGTAAGTAGAGAGTTAACCCCTGAGGATCGCAGC 


LL AQYtLDODGVVPDG LE DDS NV @ 
CTTCTGGCTCAGATCCTAGATGATGGCGTAGTGCCAGATGGGTTAGAAGATGACTCTAATGTGTAAAGAGAATGAATCCTATGT 
CGGCACTCGGTGGTAACCCCTCGCGAGAAAGTCGGGATAGGACACTCTCTATCAGAATGGATGTCTTGCTGTCATAACAGATAG 
AGAAGGTTGTGGCAGACCCTGTATCAATTAGTTGAAAGAGATTGCAAAATAGAGAATGTGTGAGAGAAGTTAGCAAGGTCCTAC 


GTCTAACCATAAGAACGGCGATAGGCGCCCCCTGGGAAGAGCTCACATCAGGGTACTATTCCTGCAATGCCCTAGTAAATGAAT 


GAAGTTGATCATGGCCAATTGGAAGAATCAC-poly (A) -3' 
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Fic. 3. DNA sequence and deduced amino acid sequence of the DissE cDNA clones. The extreme 5’ end sequence was obtained by primer- 
extension studies (see Fig. 4). A translation of the main ORF is shown in single-letter amino acid code. Solid triangles indicate the sites where 
sequence fusion occurred. 
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10 20 30 40 50 
UAUAAGAGUGAUUGGCGUCCGUACGUACCCUCUCUACUCUAAAACUCUUG 


* 3K 
UAUAAGAGUGAAUGGCGUCCGUACGUACCCAAUCUACUCUAAAACUCUUG 


* ** 
UAUAAGAGUGAAUGGCGUCCGUACGUACCCAAUCUACUCUAAAACUCUUG 


60 70a g0_b 90 100 
UAGUUUAAAUCUAAUCUAAUCUAAACUUUAUAAACGGCACUUCCUGCGUG 
1 2 
UAGUUUAAAUCUAAUCUAAUCUAAUCUABACGGCACUUCCUGCGUGUCCA 
1 2 4 
UAGUUUAAAUCUAAUCUAAUCUAAACGGCACUUCCUGCGUGUCCAUGCCC 
1 3 


110 120 130 140 


MHV-JHM 
DIssE (a) 


DIssE (b) 


UCCAUGCCCGUGGGCCUGGUCUUGUCAUAGUGCUGACAUU 
UGCCCGUGGGCCUGGUCUUGUCAUAGUGCUGACAUU 


GUGGGCCUGGUCUUGUCAUAGUGCUGACAUU 


Fic. 4. Primer extension analysis of the 5'-end of DissE. {A) The synthetic oligodeoxyribonucleotides (18-mer) complementary to the nucleo- 
tides 123-140 from the 5’ end of the parental MHV-JHM genomic RNA (Shieh et a/., 1987; Soe ef a/., 1987) was **P-labeled at the 5’ end, 
hybridized ta the gel-purified DIlssE RNA, and extended with reverse transcriptase. The products were electrophoresed on 6% polyacrwiamide 
gels containing 8 M urea. O, origin of the gel. Two primer-extended products are shown as a and b. (B) The DNA sequences of these primer- 
extended products were determined by the Maxam-Gilbert method. The 5’-end sequence of MHV-JHM genomic sequence was obtained from 
previous studies (Shieh ef a/., 1987; Soe et a/., 1987). The letters a and b represent the canonical seven-nucleotide sequence UCUAAAC and 
imperfectly repeated sequence of UAUAAAC, respectively. A bold solid line represents the nine-nucleotide sequence which is deleted in DissE 
but present in MHV-JHM. DissE (a) and DissE (b) correspond to the sequences of primer-extended products, a and b, in Fig. 4A, respectively. 


Three base substitutions are indicated by asterisks. 


DI RNAs. We therefore examined whether any signifi- 
cant secondary structure existed at rearrangement 
sites on MHV-JHM genomic RNA. The nucleotide se- 
quences surrounding deleted regions of MHV-JHM ge- 
nomic RNA were analyzed by an RNA secondary struc- 
ture program of Zuker and Stiegler (1981). The pre- 
dicted secondary structures of these rearrangement 
regions are shown in Fig. 6. All four genomic deletion 
sites have extensive and stable secondary structures. 
The free energies of these structures range from —73.0 
to —114.2 kcal/mol. Furthermore, as previously de- 
scribed for the standard MHV-JHM, the sequence sur- 
rounding the junction of leader RNA and the remaining 
5’-end genomic sequence also contains a stable sec- 
ondary structure (Soe et a/., 1987). This junction region 
includes the nine-nucleotide deletion detected in 
DissE RNA (Fig. 4B). Thus, an extensive and stable 
secondary structure exists at each parental MHV-JHM 
genomic region where deletion occurred. 


DISCUSSION 


The present study demonstrated that the smallest 
Di-specific RNA, DIssE, is composed of three discon- 
tiguous parts of the viral genome, including the 5’ end 
and 3’ end of genomic RNA. This structure is similar to 
many DI RNAs of other viruses, which typically retain 
both ends of the standard nondefective viral RNAs. Our 
previous study has demonstrated that DIssE is repli- 
cated from its negative template in the presence of 
helper virus (Makino et a/., 1988). Therefore, the DIssE 
sequence likely contains essential recognition signals 
for MHV RNA replication. The structure of DIssE RNA 
supports the likelihood that the recognition signals for 
the synthesis of negative-strand RNA and positive- 
strand RNA are localized at the 3’ end and 5’ end of 
genomic RNA, respectively. 

One of the unique features of coronavirus DI RNA is 
that subgenomic D] RNA was poorly incorporated into 
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Fic. 5. Translation of DissE-specific proteins. (A) Translation in a rabbit reticulocyte lysate of gel-purified DissE RNA. *°S-Labeled jn vitro 
translation products of DissE were analyzed by SDS-polyacrylamide gel! electrophoresis directly (lane 1), and immunoprecipitated with anti-N 
protein monoclonal antibody (lane 2) or anti-p28 antibody (lane 3). Lane 4 contains '*C-labeled marker proteins. (B) DissE-specific proteins in 
Dl-infected cells. DBT cells were mock-infected (lanes 1 and 4), infected with MHV-JHM (lanes 2 and 5), or infected with MHV-JHM containing 
DI particles (Lanes 3 and 6). At 7.5 hr postinfection, cultures were labeled with [°°S]methionine for 30 min, and cytoplasmic lysates were 


prepared, immunoprecipitated with anti-N protein monoclonal antibody (lanes 1-3) or anti-p28 antibody (lanes 4-6), and electrophoresed. Lane 
7 contains '*C-labeled marker proteins. 


L A B c DEF G 
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Fic. 6. Predicted secondary structure at the sequence rearrangement sites of MHV-JHM genomic RNA. The sequence of MHV-JHM genomic 
RNA was obtained from previously published data (Soe et a/., 1987) and our unpublished data (Shieh et a/., unpublished data). A~G represent 
the seven genes of MHV RNA. Solid boxes correspond to regions which share with DissE. Free energy of the secondary structure at each 
rearranged site is given in kilocalories per mole. Arrows indicate the rearrangement sites. 
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virus particles (Makino et a/., 1988). One of the possible 
explanations is that the DI subgenomic RNAs lack a 
packaging signal. Since all MHV-specific subgenomic 
mRNAs contain the leader sequence, yet only geno- 
mic-sized RNA is efficiently packaged into virus parti- 
cles, the packaging signal is probably located in gene 
A but not in leader sequence. The present study indi- 
cates that DissE RNA has a nine-nucleotide (UUUAL- 
AAAC) deletion at the junction between the leader RNA 
and the remaining genomic RNA sequence. However, 
this deletion is not likely to account for the failure of 
efficient DI RNA packaging into virus particles since 
DissA and the genomic RNA of a mutant MHV-JHM, 
both of which are packaged into virus particles, also 
have similar nine-nucleotide deletions (S. Makino, un- 
published data). Thus, the packaging signals may be 
localized downstream of the 5’ end 864 nucleotides. 
Recently we found that another intracellular Dl-specific 
RNA, DIssF, could be packaged more efficiently than 
DIssE (S. Makino, unpublished data). The DissF RNA 
is approximately 1.7 kb larger than DIssE and appears 
to contain more gene A sequences than DIssE, as de- 
termined from T1-oligonucleotide fingerprinting (Ma- 
kino et a/., 1985). Sequence analysis of DIssF may re- 
veal the possible reason for the poor incorporation of 
DissE RNA into virus particles. 

The data presented in this paper demonstrate exten- 
sive and stable secondary structures in the standard 
viral RNA at sites where DI RNA underwent deletions. 
This observation is consistent with a model of DI RNA 
generation, in which RNA transcription is interrupted 
at sites of hairpin loops on the template, and the RNA 
intermediates then fall off and rebind at new sites on 
the template to generate an RNA with extensive dele- 
tions. We have previously suggested that coronavirus 
RNA synthesis may utilize a discontinuous, nonproces- 
sive mechanism, in which RNA transcription pauses at 
sites of secondary structures (Baric et a/., 1987). The 
incomplete RNA intermediates dissociate from tem- 
plates and then rejoin the temple for subsequent RNA 
transcription. This mechanism is supported by the 
findings that MHV can undergo RNA recombination at 
an extremely high frequency (Makino et a/., 1986a), 
and that free incomplete RNA transcription products of 
various sizes were detectable in the cytoplasm of 
MHV-infected cells (Baric et a/., 1985, 1987). Further- 
more, the sizes of these RNA products correspond to 
the lengths between the 5’ end and the sites of hairpin 
loops (Baric et a/., 1987), in agreement with the notion 
that transcription pauses at these hairpin loops. Thus, 
the potential hairpin loops present in the genomic RNA 
at the DI RNA rearrangement sites could have inter- 
rupted RNA transcription. The incomplete RNA tran- 


script may join the RNA template at the downstream 
rearrangement sites and create deleted RNA as a re- 
sult. However, there is no consensus sequence at the 
sites of RNA deletion and reinitiation. It is not known 
how the reinitiation of RNA synthesis occurred. 

The deletion of the nine nucleotides (UUUAUAAAC) 
at the 5’ end where the leader RNA joins the genomic 
RNA may have been caused by the same discontinu- 
ous and nonprocessive transcription mechanism. It is 
interesting to note that the UCUAAAC, which is the 
consensus sequence for the leader RNA binding (Shieh 
et a/., 1987), is imperfectly repeated (UAUAAAC) at 
nine nucleotides downstream (Shieh et a/., 1987). It is 
these nine nucleotides which were deleted in DIssE 
RNA. Similar nine-nucleotide deletions have also been 
noted in the genomic RNA of DissA, and that of an 
MHV-JHM mutant virus (S. Makino, unpublished data). 
This RNA structure suggests that RNA synthesis may 
pause at the first repeat, and then reinitiate at the sec- 
ond repeat because of the binding of the incomplete 
RNA transcript to the second repeat. Finally, the heter- 
ogeneity in the number of UCUAA repeats in DI RNAs 
also supports the discontinuous nature of coronavirus 
RNA replication. Similar heterogeneity has been noted 
in the genomic RNA of several different MHV strains (S. 
Makino and M. M. C. Lai, manuscript in preparation). 
Thus, DI RNA may be a product of discontinuous, non- 
processive RNA replication of coronaviruses. 

There was a significant difference between the ap- 
parent molecular mass of the DissE-specific protein 
products, 88K and 79K, and the predicted molecular 
mass of the potential product of the large ORF of DissE 
RNA. This difference could be due to unusual configu- 
rations affecting electrophoretic migration, or due to 
the presence of phosphorylation, since the N protein is 
phosphorylated (Stohiman and Lai, 1979) and protein 
translated jn vitro could be phosphorylated (Chatto- 
padhyay and Banerjee, 1987). A similar difference be- 
tween the predicted and actual molecular mass of the 
N protein has previously been noted (Skinner and Sid- 
dell, 1983). The relationship between the two protein 
species is not clear. The N protein has also been 
shown to consist of multiple species (Robbins et a/., 
1986). It is not clear whether these proteins play any 
functional roles in Dl-infected cells. Typically, DI RNAs 
do not synthesize any protein; however, in the Sindbis 
virus system, translation products have been detected 
from a DI RNA (Migliaccio et a/., 1985). 

Although MHV genomic RNA and DissE RNA are the 
major RNA species among MHV-specific mRNA spe- 
cies in virus-infected cells (Makino et a/., 1985, 1988) 
(Fig. 1), the gene products of these two mRNAs, RNA 
polymerase and both the 79K and 88K proteins, were 
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present in small quantities in virus-infected cells (Fig. 
5B). We have previously demonstrated that the pres- 
ence of stable secondary structure at the 5’ end non- 
coding regions of the polymerase gene reduced the 
amount of polymerse protein synthesized in vitro (Soe 
et al., 1987). Also, as discussed previously, the pres- 
ence of the small ORF encoding eight amino acids (Fig. 
3) may reduce the number of ribosomes reaching the 
downstream optimal translation site (Soe et a/., 1987). 
Since DIssE RNA has a 9’ end structure similar to that 
of genomic RNA, the DIssE RNA may provide a tool to 
better our understanding of the mechanism of transla- 
tional control of MHV RNAs. Furthermore, the fusion 
protein synthesized by DIssE RNA may be useful for 
understanding the functional and structural domains of 
the MHV polymerase and N protein. 
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